Picture for Xing Sun

Xing Sun

Moment-Video: Diagnosing Temporal Fidelity of Video MLLMs on Momentary Visual Events

Add code
Jun 01, 2026
Viaarxiv icon

MoG: Mixture of Experts for Graph-based Retrieval-Augmented Generation

Add code
May 29, 2026
Viaarxiv icon

Toward Native Multimodal Modeling: A Roadmap

Add code
May 25, 2026
Viaarxiv icon

Performance-Driven Policy Optimization for Speculative Decoding with Adaptive Windowing

Add code
May 14, 2026
Viaarxiv icon

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Add code
Apr 06, 2026
Viaarxiv icon

HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention

Add code
Apr 01, 2026
Viaarxiv icon

MHPO: Modulated Hazard-aware Policy Optimization for Stable Reinforcement Learning

Add code
Mar 14, 2026
Viaarxiv icon

Deep Tabular Research via Continual Experience-Driven Execution

Add code
Mar 12, 2026
Viaarxiv icon

Can Unified Generation and Understanding Models Maintain Semantic Equivalence Across Different Output Modalities?

Add code
Feb 27, 2026
Viaarxiv icon

DenseMLLM: Standard Multimodal LLMs are Intrinsic Dense Predictors

Add code
Feb 15, 2026
Viaarxiv icon